CDS

Accession Number TCMCG019C04313
gbkey CDS
Protein Id XP_022933842.1
Location join(2509206..2509304,2509384..2509453,2509650..2509747,2510321..2510462,2510550..2510650,2510828..2510911,2511066..2511116,2511213..2511278,2511355..2511420,2511515..2511595,2511677..2511754,2511905..2512075,2512533..2512611,2512800..2512918,2513156..2513512)
Gene LOC111441139
GeneID 111441139
Organism Cucurbita moschata

Protein

Length 553aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023078074.1
Definition putative clathrin assembly protein At5g35200 isoform X1 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category TU
Description Clathrin assembly protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K20043        [VIEW IN KEGG]
ko:K20044        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005886        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTCAGGTGGGGGTACACAGAACAGCCTTAGAAAAGCTCTGGGAGCTCTGAAGGATACTACCACAGTTTCATTAGCTAAAGTTAACAGTGATTATAAGGAATTAGACATTGCTATAATTAAGGCAACAAATCATGTTGAACGCCCTGCAAAGGAAAAACATATCCGAGCTATATTCGCGGATATTTCAGCAACCATGCCTAGAGCTGATGTTGCATATTGCATCCAAGCTTTGGCAAGAAGATTATCCAAGACTCATAATTGGGCAGTTGCATTAAAAACTTTGGTTGTTATCCATCGTGCTTTACGGGAAATGGACCCCACATTTCGTGAAGAACTCATTAACTCTGGCAGGAGAAGAAGCCACATGCTTAATTTAGCTCATTTTAAAGACGATTCCAGTGCTAAGGCTTGGGATTATTCTGCTTGGGTACGTTCATATGCCTTATTTTTGGAGGAGAGGTTGGAATGTTTCCGTGTACTGAAGTACGATGTCGAGGCAGATAGTGTGAGAACCAAAGATCTAGACACTGCCGAGTTGCTTGAGCAGATGCCAGCATTACAAGAGCTTCTGTATCGCGTACTTGGATGTCAGCCACAAGGAGCTGCAGTTCATAATTTTGTAATTCAGCTAGCCCTTTCATTGGTTGCTTCTGAAAGCGTCAAAATTTATCAGGCTATCAGTGATGGTACTGCCAATTTAGTTGACAAGTTTTTTGAGATGCAACCCCAAGATGCAATGAAAGCCCTGGATATATACAGGAGGGCTGGCCAGCAGGCAGAAAGGCTCTCTGAATTCTATGAAGTTTGTAAAAGTCTCTATATTGGGCGTGGCGAGAAGTTTATAAAGATTGAACAGCCTCCTGCATCATTTTTACAAGCCATGGAAGATTATGTACGAGAAGCTCCACGAGTTTCGGCAGCTCGTAAGGATCAGCAGACTGCTGGTAATAAAGTAGCTGCCCCTAAAGAAGTTCTGGCTGTTGAGGACAAGAAGGAACCAGAAGTGCAAACGGAACAACCAGTGACACCTCCACCAGCTGCGTCTCCGCCACCCCCTGAACCAGTAAAAGTAGAACCAGTCGTGACTGAACAACCTGATTTATTGGGTTTGAATGATCCTGTAGCTGAGACCACTTCCAATTTAGATGAGAAGAATTCTTTGGCGTTGGCTGTTGTCCCAGTTGCCGACCAACAAACCAGTTCTGCTCCAAGCCAAGTTAATGGTACTACAACTACAGGCTGGGAATTGGCACTTGTTACGGCACCAAGCACAAATGAAAGTGTAGCTGCTACAAGCAAATTGGCCGGAGGGTTGGACTTGCTTACATTAGACAGCTTATATGATGATGCAATCAGAAGAAATAATCAGAACGTGAGTTACAATCCATGGGAGCCAGTTCCAATGCCCGGTACCATGATGCAACAGCCAATCCATGATCCCTTTTTCTCCTCAACTGTGGTGACTGCACCTCATTCAGTACAAATGGCAGCCATGGCCAACCAGCAGCAAGCTTTCATATTTCAACAGCAGCAGCAGATGATGATGATGGCTCCTTCGCAACAGTCGAATCCTTTCGGAAATCCTCATGGGACCAATGCCTACCACTACAATCCGGGTATGCCTGTTCACGCTTCCAATCCTTTTACTGGTCTCATTTAA
Protein:  
MSGGGTQNSLRKALGALKDTTTVSLAKVNSDYKELDIAIIKATNHVERPAKEKHIRAIFADISATMPRADVAYCIQALARRLSKTHNWAVALKTLVVIHRALREMDPTFREELINSGRRRSHMLNLAHFKDDSSAKAWDYSAWVRSYALFLEERLECFRVLKYDVEADSVRTKDLDTAELLEQMPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTANLVDKFFEMQPQDAMKALDIYRRAGQQAERLSEFYEVCKSLYIGRGEKFIKIEQPPASFLQAMEDYVREAPRVSAARKDQQTAGNKVAAPKEVLAVEDKKEPEVQTEQPVTPPPAASPPPPEPVKVEPVVTEQPDLLGLNDPVAETTSNLDEKNSLALAVVPVADQQTSSAPSQVNGTTTTGWELALVTAPSTNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPMPGTMMQQPIHDPFFSSTVVTAPHSVQMAAMANQQQAFIFQQQQQMMMMAPSQQSNPFGNPHGTNAYHYNPGMPVHASNPFTGLI